智能论文笔记

Scale-Invariant Specifications for \\Human-Swarm Systems

Joel Meyer , Ahalya Prabhakar , Allison Pinosky , Ian Abraham , Annalisa Taylor , Millicent Schlafly , Katarina Popovic , Giovani Diniz , Brendan Teich , Borislava Simidchieva

分类：机器人

2022-12-06

We present a method for controlling a swarm using its spectral decomposition -- that is, by describing the set of trajectories of a swarm in terms of a spatial distribution throughout the operational domain -- guaranteeing scale invariance with respect to the number of agents both for computation and for the operator tasked with controlling the swarm. We use ergodic control, decentralized across the network, for implementation. In the DARPA OFFSET program field setting, we test this interface design for the operator using the STOMP interface -- the same interface used by Raytheon BBN throughout the duration of the OFFSET program. In these tests, we demonstrate that our approach is scale-invariant -- the user specification does not depend on the number of agents; it is persistent -- the specification remains active until the user specifies a new command; and it is real-time -- the user can interact with and interrupt the swarm at any time. Moreover, we show that the spectral/ergodic specification of swarm behavior degrades gracefully as the number of agents goes down, enabling the operator to maintain the same approach as agents become disabled or are added to the network. We demonstrate the scale-invariance and dynamic response of our system in a field relevant simulator on a variety of tactical scenarios with up to 50 agents. We also demonstrate the dynamic response of our system in the field with a smaller team of agents. Lastly, we make the code for our system available.

translated by 谷歌翻译

Scale-Invariant Fast Functional Registration

Muchen Sun , Allison Pinosky , Ian Abraham , Todd Murphey

分类：计算机视觉 | 机器人

2022-09-26

功能配准算法表示点云为函数（例如，空间占用场），避免了常规最小二乘Quares注册算法中不可靠的对应估计。但是，现有的功能注册算法在计算上很昂贵。此外，在基于CAD模型的对象本地化等任务中，必须使用未知量表的注册能力，但是功能注册中没有这种支持。在这项工作中，我们提出了一种比例不变的线性时间复杂性功能配准算法。我们通过使用正顺序基函数在功能之间的L2距离之间有效地近似实现线性时间复杂性。正统基函数的使用导致与最小二乘配准兼容的公式。受益于最小二乘的公式，我们使用翻译反转不变测量的理论来解除尺度估计，从而实现规模不变的注册。我们在标准的3D注册基准上评估了所提出的算法，称为FLS（功能最小二乘），显示FLS的数量级比最先进的功能配准算法快，而无需损害准确性和鲁棒性。 FLS还胜过基于最小二乘的最小二乘注册算法，其精度和鲁棒性具有已知和未知量表。最后，我们证明将FLS应用于具有不同密度和部分重叠的寄存点云，同一类别中不同对象的点云以及带有嘈杂RGB-D测量值的真实世界对象的点云。

translated by 谷歌翻译

Majorization Minimization Methods for Distributed Pose Graph Optimization

Taosha Fan , Todd Murphey

分类：机器人

2021-07-30

我们考虑分布式姿势图优化（PGO）的问题，该问题在多机器人同时定位和映射（SLAM）中具有重要的应用。我们提出了用于分布式PGO（$ \ mathsf {mm \！\！\！\！\！pgo} $）的大量最小化方法（mm）方法，该方法适用于一类宽类强大的损失内核。 $ \ mathsf {mm \！\！ - \！\！pgo} $方法可以在轻度条件下收敛到一阶关键点。此外，请注意$ \ mathsf {mm \！\！ - ！ - \！\！pgo} $方法是让人联想到近端方法，我们利用Nesterov的方法并采用自适应重启来加速收敛。生成的分布式PGO的加速MM方法 - 既有网络中的主节点（$ \ Mathsf {amm \！\！\！\！\！\！！ - \！\！pgo}^{＃} $） - 与$ \ mathsf {mm \！\！\！ - \！\！pgo} $相比，收敛速度更快，而无需牺牲理论保证。特别是，$ \ mathsf {amm \！\！\！ - \！\！ $ \ mathsf {amm \！\！\！\！pgo}^*$使用主节点从所有其他节点汇总信息。这项工作的功效通过对2D和3D SLAM基准数据集的广泛应用以及与现有最新方法的全面比较来验证，这表明我们的MM方法更快地收敛，并为分布式PGO提供更好的解决方案。

translated by 谷歌翻译

Learning from Human Directional Corrections

Wanxin Jin , Todd D. Murphey , Zehui Lu , Shaoshuai Mou

分类：机器人 | 机器学习

2020-11-30

本文提出了一种方法，该方法使机器人能够从人类的定向校正中逐渐学习控制目标函数。现有方法从人类的幅度校正中学习，并且需要人类仔细选择校正幅度，否则可以很容易地导致过度校正和学习效率低下。所提出的方法仅需要人类的定向校正 - 校正，该校正仅指示控制变化的方向，而不会指示其幅度 - 在机器人运动期间的某些时间实例应用。我们仅假设人类的校正，无论其幅度如何，在一个方向上指向机器人当前运动相对于隐含控制目标函数。因此，人类的有效修正总是占校正空间的一半。所提出的方法使用校正的方向来基于切割平面技术更新目标函数的估计。我们建立了理论结果，以证明该过程保证了学习目标函数的收敛到隐含的目标。通过数值例子，对两个人机游戏的用户研究以及真实世界的四轮车实验进行了拟议的方法。结果证实了该方法的收敛性，并表明该方法更有效（成功率较高），有效/轻松（需要较少人力校正），可访问（更少的早期浪费的试验）而不是最先进的机器人交互式学习计划。

translated by 谷歌翻译

Learning from Sparse Demonstrations

Wanxin Jin , Todd D. Murphey , Dana Kulić , Neta Ezer , Shaoshuai Mou

分类：机器人 | 机器学习

2020-08-05

本文开发了连续的蓬松蛋白可区分编程（连续PDP）的方法，该方法使机器人能够从少数稀疏的关键帧中学习目标函数。带有一些时间戳记的密钥帧是所需的任务空间输出，预计机器人将顺序遵循。密钥帧的时间戳可能与机器人的实际执行时间不同。该方法共同找到一个目标函数和一个盘绕函数，以使机器人的产生轨迹顺序遵循关键帧，并以最小的差异损失。连续的PDP通过有效求解机器人轨迹相对于未知参数的梯度，可以最大程度地减少投影梯度下降的差异损失。该方法首先在模拟机器人臂上进行评估，然后应用于6-DOF四极管，以在未建模的环境中学习目标函数。结果表明，该方法的效率，其处理密钥帧和机器人执行之间的时间错位的能力以及将客观学习对看不见的运动条件的概括。

translated by 谷歌翻译

Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media

Yuting Guo , Swati Rajwal , Sahithi Lakamana , Chia-Chun Chiang , Paul C. Menell , Adnan H. Shahid , Yi-Chieh Chen , Nikita Chhabra , Wan-Ju Chao , Chieh-Ju Chao

分类：自然语言处理

2022-12-23

Migraine is a high-prevalence and disabling neurological disorder. However, information migraine management in real-world settings could be limited to traditional health information sources. In this paper, we (i) verify that there is substantial migraine-related chatter available on social media (Twitter and Reddit), self-reported by migraine sufferers; (ii) develop a platform-independent text classification system for automatically detecting self-reported migraine-related posts, and (iii) conduct analyses of the self-reported posts to assess the utility of social media for studying this problem. We manually annotated 5750 Twitter posts and 302 Reddit posts. Our system achieved an F1 score of 0.90 on Twitter and 0.93 on Reddit. Analysis of information posted by our 'migraine cohort' revealed the presence of a plethora of relevant information about migraine therapies and patient sentiments associated with them. Our study forms the foundation for conducting an in-depth analysis of migraine-related information using social media data.

translated by 谷歌翻译

Bayesian Semiparametric Model for Sequential Treatment Decisions with Informative Timing

Arman Oganisian , Kelly D. Getz , Todd A. Alonzo , Richard Aplenc , Jason A. Roy

分类：机器学习 | (统计)机器学习

2022-11-29

We develop a Bayesian semi-parametric model for the estimating the impact of dynamic treatment rules on survival among patients diagnosed with pediatric acute myeloid leukemia (AML). The data consist of a subset of patients enrolled in the phase III AAML1031 clinical trial in which patients move through a sequence of four treatment courses. At each course, they undergo treatment that may or may not include anthracyclines (ACT). While ACT is known to be effective at treating AML, it is also cardiotoxic and can lead to early death for some patients. Our task is to estimate the potential survival probability under hypothetical dynamic ACT treatment strategies, but there are several impediments. First, since ACT was not randomized in the trial, its effect on survival is confounded over time. Second, subjects initiate the next course depending on when they recover from the previous course, making timing potentially informative of subsequent treatment and survival. Third, patients may die or drop out before ever completing the full treatment sequence. We develop a generative Bayesian semi-parametric model based on Gamma Process priors to address these complexities. At each treatment course, the model captures subjects' transition to subsequent treatment or death in continuous time under a given rule. A g-computation procedure is used to compute a posterior over potential survival probability that is adjusted for time-varying confounding. Using this approach, we conduct posterior inference for the efficacy of hypothetical treatment rules that dynamically modify ACT based on evolving cardiac function.

translated by 谷歌翻译

ToDD: Topological Compound Fingerprinting in Computer-Aided Drug Discovery

Andac Demir , Baris Coskunuzer , Ignacio Segovia-Dominguez , Yuzhou Chen , Yulia Gel , Bulent Kiziltan

分类：机器学习 | 人工智能

2022-11-07

In computer-aided drug discovery (CADD), virtual screening (VS) is used for identifying the drug candidates that are most likely to bind to a molecular target in a large library of compounds. Most VS methods to date have focused on using canonical compound representations (e.g., SMILES strings, Morgan fingerprints) or generating alternative fingerprints of the compounds by training progressively more complex variational autoencoders (VAEs) and graph neural networks (GNNs). Although VAEs and GNNs led to significant improvements in VS performance, these methods suffer from reduced performance when scaling to large virtual compound datasets. The performance of these methods has shown only incremental improvements in the past few years. To address this problem, we developed a novel method using multiparameter persistence (MP) homology that produces topological fingerprints of the compounds as multidimensional vectors. Our primary contribution is framing the VS process as a new topology-based graph ranking problem by partitioning a compound into chemical substructures informed by the periodic properties of its atoms and extracting their persistent homology features at multiple resolution levels. We show that the margin loss fine-tuning of pretrained Triplet networks attains highly competitive results in differentiating between compounds in the embedding space and ranking their likelihood of becoming effective drug candidates. We further establish theoretical guarantees for the stability properties of our proposed MP signatures, and demonstrate that our models, enhanced by the MP signatures, outperform state-of-the-art methods on benchmark datasets by a wide and highly statistically significant margin (e.g., 93% gain for Cleves-Jain and 54% gain for DUD-E Diverse dataset).

translated by 谷歌翻译

UNav: An Infrastructure-Independent Vision-Based Navigation System for People with Blindness and Low vision

Anbang Yang , Mahya Beheshti , Todd E Hudson , Rajesh Vedanthan , Wachara Riewpaiboon , Pattanasak Mongkolwat , Chen Feng , John-Ross Rizzo

分类：计算机视觉

2022-09-22

现在，基于视觉的本地化方法为来自机器人技术到辅助技术的无数用例提供了新出现的导航管道。与基于传感器的解决方案相比，基于视觉的定位不需要预安装的传感器基础架构，这是昂贵，耗时和/或通常不可行的。本文中，我们为特定用例提出了一个基于视觉的本地化管道：针对失明和低视力的最终用户的导航支持。给定最终用户在移动应用程序上拍摄的查询图像，该管道利用视觉位置识别（VPR）算法在目标空间的参考图像数据库中找到相似的图像。这些相似图像的地理位置用于采用加权平均方法来估计最终用户的位置和透视N点（PNP）算法的下游任务中，以估计最终用户的方向。此外，该系统实现了Dijkstra的算法，以根据包括Trip Origin和目的地的可通航地图计算最短路径。用于本地化和导航的层压映射是使用定制的图形用户界面构建的，该图形用户界面投影了3D重建的稀疏映射，从一系列图像构建到相应的先验2D楼平面图。用于地图构造的顺序图像可以在预映射步骤中收集，也可以通过公共数据库/公民科学清除。端到端系统可以使用带有自定义移动应用程序的相机安装在任何可互联网的设备上。出于评估目的，在复杂的医院环境中测试了映射和定位。评估结果表明，我们的系统可以以少于1米的平均误差来实现本地化，而无需了解摄像机的固有参数，例如焦距。

translated by 谷歌翻译

On the Factory Floor: ML Engineering for Industrial-Scale Ads Recommendation Models

Rohan Anil , Sandra Gadanho , Da Huang , Nijith Jacob , Zhuoshu Li , Dong Lin , Todd Phillips , Cristina Pop , Kevin Regan , Gil I. Shamir

分类：机器学习

2022-09-12

对于工业规模的广告系统，对广告点击率（CTR）的预测是一个核心问题。广告点击构成了一类重要的用户参与，通常用作广告对用户有用的主要信号。此外，在每次点击收费的广告系统中，单击费用期望值直接输入价值估计。因此，对于大多数互联网广告公司而言，CTR模型开发是一项重大投资。此类问题的工程需要许多适合在线学习的机器学习（ML）技术，这些技术远远超出了传统的准确性改进，尤其是有关效率，可重复性，校准，信用归因。我们介绍了Google搜索广告CTR模型中部署的实用技术的案例研究。本文提供了一项行业案例研究，该研究强调了当前的ML研究的重要领域，并说明了如何评估有影响力的新ML方法并在大型工业环境中有用。

translated by 谷歌翻译